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Abstract. The recurrence rate and determinism are two of the basic complexity measures studied in the 
recurrence quantification analysis. In this paper, the recurrence rate and determinism are expressed in terms 
of the correlation sum, and strong laws of large numbers are given for them. 



1. Introduction 

The notion of recurrence is one of the fundamental notions in the theory of dynamical systems. Recurrence 
plots, introduced by Eckmann, Kamphorst and Ruelle [4] in 1987, provide a powerful tool for recurrence 
visualization. The recurrence plot of the trajectory xq, Xi, . . . , a;„_i of a point x = Xq is a, black-and-white 
image with a pixel (i, j) being black if and only if the trajectory at time j recurs to the state at time «; that 
is, the points close to each other. The recurrence plot provides a two-dimensional representation 

of an (arbitrary-dimensional) dynamical system. 

The quantitative study of recurrence plots, called recurrence quantification analysis (RQA), was initiated 
by Zbilut and Webber in [18], where the authors introduced several measures of complexity based on the 
recurrence plot. Among them, the recurrence rate RR and the determinism DET are probably the most 
important and widely used ones. Their definitions are based on diagonal lines (that is, segments of black 
points parallel to the main diagonal), which correspond to recurrences of parts of the trajectory. 

Since the seminal paper [18], new RQA tools, quantities and modifications were introduced and recurrence 
quantification has been applied in many areas of science, cf. [12] and [10], among others. 

Despite its wide use, theoretical properties of recurrence measures were studied rarely. Asymptotic 
properties of RQA characteristics were studied e.g. in [6, 3, 5, 16, 19]. The correlation sum, tightly connected 
with the recurrence rate, as well as derived quantities such as the correlation integral, correlation dimension 
and correlation entropy, were studied extensively, cf. [9]. One of the fundamental results, namely the strong 
law for correlation sums of ergodic processes, was proved (by different methods and under different conditions) 
in [14, 13, 1, 15, 11]. It states that, for a separable metric space {Z,d) and a /i-ergodic dynamical system on 
it, the correlation sum of the trajectories of almost every point x Cz Z with every (up to countably many) 
r > converges to the correlation integral 

(1) lim C(x, n, r) = c(r), where c{r) = fi x fi{(x , y) : d{x,y)<r} 

n— ^cxD 

and C{x,n,r) = (1/n^) • card{(i,j) : < i,j < n, d{xi,Xj) < r}. Recall that the correlation integral c(r) 
is just the probability P{d{X,Y) < r} that two independent random variables X,Y with distribution fi are 
r-close. It is worth noting that the correlation sum, which measures the level of dependence in a trajectory, 
asymptotically turns into the probability of closeness of two independent random variables. 

The main purpose of the present paper is to study asymptotic properties of RQA characteristics for 
ergodic processes. We start with a proof of a simple formula giving an expression of the recurrence rate via 
correlation sums, see Proposition 1: 

(2) RRr-fc.Cr-(fc-l)-Cr+i, 

where m is the embedding dimension, k is the prediction horizon and CJ? and RR^* denote the correlation 
sum and recurrence rate, respectively; for the corresponding definitions see Section 2. The relationship (2) 
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directly permits to express the determinism DET in terms of the correlation sums 



(3) 



fc • - (fc - 1) • c;:vi 



RR'i" 

The bridging formulas (2) and (3) enable us to derive strong laws of large numbers for the recurrence 
rate and determinism from that for the correlation sum, see Theorems 4 and 5. To this end, however, we 
need to generalize (1) to the case when d is a pseudometric on rather than a metric. For pseudometrics 
induced by Borel maps, this problem was studied in [15, Theorem 2]. In the general case, (1) was proved by 
Manning and Simon [11]; for details, see Theorem 19 in Section 7. 

We apply the strong laws to iid processes, Markov chains and autoregressive processes, and derive explicit 
formulas for the recurrence integral, asymptotic determinism and mean diagonal line length of these processes; 
see Table 1 and Section 4. On simulated data we demonstrate the speed of convergence of RQA quantities 
when the length n of (the beginning of) the trajectory goes to infinity. 





IID 




Markov chain 


recurrence integral 


^fe+m-l _ 


- 1)"] 


Q,^fc+m-2 [fc _ (fc _ 1) ^] 


asymptotic determinism 


[k-{k- 


l)a] 


[k-{k-l)l3] 


mean diagonal line length 


k + a/{l - 


a) 


fc + /3/(l-/3) 



Table 1. Formulas for RQA asymptotics for iid processes and Markov chains; here a = c(r) 
and /3 — C2(r)/c(r). 



Further, in Section 5 we give an example showing that higher entropy of a process does not necessarily 
mean smaller (asymptotic) determinism and that an iid process can have higher determinism than a non-iid 
one with the same one-dimensional marginals. This is a rather unexpected behavior since, in a sense, entropy 
and determinism are opposite notions. 

We also discuss the problem of choosing the distance threshold r. In the literature, the distance threshold 
is selected such that, for the embedding dimension m, the recurrence rate attains a fixed level. This rule, 
however, can lead to the existence of the so-called spurious structures in recurrence plots of iid processes, as 
noted in [17], see also [12, Section 3.2.4]. In Section 6 we show why this happens. For a large embedding 
dimension m and distance threshold selected by this rule, the determinism det™(rm) is close to one even 
for iid processes. Hence, the appearance of spurious structures is a direct consequence of the selection rule 
fixing the recurrence rate. 

The explicit formula for the asymptotic determinism det™ can be stated in terms of conditional probabil- 
ities that k or {k + 1) consecutive recurrences occur given that one recurrence has occurred; see Theorem 10. 
Hence, if the process under consideration is a Markov one of order p, then over-embedding to dimension 
m > p leaves the asymptotic determinism unchanged; see Corollary 11. This is demonstrated in Section 4.3 
for autoregressive processes. There we discuss possible use of RQA characteristics for estimation of the order 
of such processes. 

The paper is organized as follows. In Section 2 we recall the definitions of RQA measures and we prove (2), 
see Proposition 1. The strong laws are stated in Section 3 and, in Section 4, they are applied to iid processes, 
Markov chains and autoregressive processes. Relationship between entropy and asymptotic determinism is 
discussed in Section 5 and the so-called spurious structures in recurrence plots of iid processes are explained 
in Section 6. In Section 7 we discuss the strong law for correlation sums on pseudometric spaces. 

2. Recurrence quantification analysis (RQA) and correlation sums 

In this section we recall the definitions of basic RQA measures and of the correlation sum for (embedded) 
trajectories of a general iS-valued process. To make the notation easier, we write a;™, as a shorthand for 
(^^) (^0^., respectively 

Let S — (S*, g) be a metric space. Fix an integer m > 1 called the embedding dimension. Let S*™ be the 
embedding space of w-tuples s™ equipped with a metric compatible with the product topology. Natural 



STRONG LAWS FOR RECURRENCE QUANTIFICATION ANALYSIS 



3 



choices for gi™ are the Manhattan (Li), Euchdean {L2) or Chebyshev (L^) metrics, the latter given by 

(4) max g{s,,t,), 

Q<j<m 

but in general we do not restrict to be one of these. 

Let S°° denote the space of all sequences of points from S. This product space is usually equipped 
with a metric, say with e°°(a;g°, yg") = J2i • niinjl, Q{xi,yi)}. In practice, however, we know just (finite) 
beginnings of trajectories and thus we are not able to compute the distance exactly. That is why we use 
pseudometrics instead, depending only on the first members of sequences. For an integer > 1, called the 
prediction horizon, a pseudometric d™ on S°° is defined by 

(5) dT{x^,y^)=m'^xp"'{xT,yT)- 

Q<i<k 

For A; = 1 we write simply d™ instead of d™. Notice that d™ depends only on the first (to + fc — 1) members 
of x'^,y^. 

2.1. RQA measures. Fix a sequence x = x'^ £ S°° and consider the embedded trajectory x = ig", 
Xi = e S*™. Fix also a distance threshold r > 0. For z, j € N (here N stands for the set of non-negative 
integers {0, 1, . . . }) we say that the couple {i,j) is an r-recurrence (in the m'th embedding of the trajectory 
of x) if 

d"^{xT,xf)^e^^{xT.xf)<r. 

The recurrence plot of dimension n is a square n x n matrix of zeros and ones, with the entry at {i,j) 
(0 l£ i,j < n) equal to one if and only if {i,j) is a recurrence. Usually, the recurrence plot is visualized by a 
black-and-white image, with black pixels representing recurrences. Let us note that to construct the n x n 
recurrence plot (in the TO'th embedding) one needs to know only the first (n -I- m — 1) members Xq^™^^ of 

X. 

Diagonal lines are basic patterns in the recurrence plot. We say that (i,j) is a beginning of a diagonal 
line of length fc > 1 in the n x n recurrence plot if the following are true: 

• ^ i, j ^ n — k; 

• {i + h,j + h) is a recurrence for every < h < k; 

• either at least one of i,j is equal to or (i — 1, j — 1) is not a recurrence; 

• either at least one of i + fc, j + fc is equal to n or {i + fc, j + k) is not a recurrence. 

For 0<i,j<n— k this is equivalent to 

d^ixr,x^)<r, d^{xT.^,xf_,)>r and d^{xT+k.xf+^) > r. 

The number of lines of length k in the n x n recurrence plot is denoted by L™ = L™(x,n, r). Notice that 
the main diagonal line (i.e. the case i = j) is not excluded, thus L™(2:, n, r) — 1; further, L™{x, n,r) = for 
every k > n. 

Now fix the prediction horizon k > 1. The k-recurrence rate RR™ is the percentage of recurrences 
contained in diagonal lines of length at least fc; that is, 

(6) RRI" = RRI" (x, ^, r) = ^ ^ / . Lr. 

l>k 

The k-determinism DET™ is the ratio of the fc-recurrence rate and 1-recurrence rate 

(7) DETl"=DETr(x,n,r) = ^ 

(here and throughout we always assume that the denominator is non-zero; otherwise we leave the corre- 
sponding quantity undefined). The k-average line length LAVG™ is the average length of diagonal lines not 
shorter than k 

RR™ 

again, this characteristic depends also on x, n, r. For the definitions of other RQA characteristics, such as 
the (Shannon) entropy of diagonal line length, trend or measures based on vertical lines, see e.g. [12]. 
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2.2. Correlation sum. Tightly connected with the recurrence rate is the notion of correlation sum, studied 
by Grassberger and Procaccia [8, 7] in relation to the correlation dimension. For a sequence x = x"^ e S°° , 
the embedding dimension m, the prediction horizon k, the distance threshold r > and n > I, the correlation 
sum is defined by 



(9) 



Cr = Cr(x,n,r) = ^card{(z,j): 0<i,j<{n~k), dTixT < r}. 

n+m— 1 



Here, as above, the quantity depends only on the beginning Xq of x. C™ measures the relative frequency 
of recurrences (in the m'th embedding) followed by at least {k — 1) other recurrences. Since, in a diagonal 
line of length I > k, just the first {I — k + 1) points are followed by (fc — 1) other recurrences, it immediately 
follows that 



(10) 



l>k 



for every m, fc > 1. Comparison with (6) gives the next statement. 
Proposition 1. Form,k > 1, 



k ■ Ci 



(fe-l)-C 



fe+i- 



Proof. By (10) and (6) we have 



i^RRil 



= k 



l>k+l 



l>k+l 



kn'cT - (fc - 1) E - 



/>A;+1 



= kn'CT^ik-l)n'CZ\„ 
from which the assertion immediately follows. 

Validity of the previous relation can be also seen from the following picture 



□ 



a b:\b\ = k 

of a diagonal line of length Z = |a| + |6| > fc. The a-dots are counted in the ^-recurrence rate as well as in 
both the fc and (fc + l)-correlation sum. On the other hand, all of the &-dots are counted in the fc-recurrence 
rate, but only the first one is counted in the fc-correlation sum and none in the (fc + l)-correlation sum. This 
gives RR™ — fc(C™ — C™_^-^) + C^j^, which is equivalent to the formula from Proposition 1. 

As a corollary of Proposition 1 we can immediately obtain a formula for the determinism in terms of 
correlation sums. Since (10) gives {^/n'^)J2i>k^r = ^k 



(11) 



LAVGI; 



" we also obtain that 

Cm 
k + 1 



CV 



c 



fc+1 



As was noted by many authors, if the metric in the embedding space is the Chebyshev one (see (4)), 
the embedded recurrence quantities can be expressed in terms of the non-embedded ones. Let us formulate 
this as a lemma; there, L^, C^, RR^ stand for L^, C^, RR^, respectively 



Lemma 2. Let m, fc > 1 and g"^ be given by (4). Then 



L™(x,n,r) = Lhix,n',r) 

and 

RRT {x,n,r)^ 
where h — k + m — 1 and n' — n -\- m — \ . 



Cr(x,n,r) = 



Gh[x,n\r) 



\RRh{x,n ,r) - (m - 1) • {Ch{x,n ,r) - Ch+i{x,n ,r))\ , 
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Proof. By (4), (f^{x°° ,x°°) < r if and only if g{xi+i, Xj+i) < r for every < I < h. Thus the first equahty is 
an immediate consequence of the definition of diagonal lines and the second one follows from (10). Further, 
(6) gives 

n2RR™(a;,n,r) = ^ (/' - (to - 1)) • L,,(x, n', r) 

l'>h 

= (n')' • RRh{x, n', r) - (to - 1) • ^ Lh{x, n' , r). 

l'>h 

Hence, using (10), also the third formula is proved. □ 

3. Strong laws for RQA 

Here, among other results, we formulate and prove strong laws of large numbers for the recurrence rate 
and determinism. First, the necessary notions and results are summarized. By a space we always mean a 
topological space. 

3.1. Preliminaries. Let Z be a space and Bz be the Borel tr-algebra on Z. A (measure-theoretical) 
dynamical system is a quadruple {Z, Bz, ^J■^ T), where /i is a probability measure on (Z, Bz) and T : Z — > Z is 
a (Borel) measurable map which preserves fi, that is, fi{T~^{B)) = fJ,(B) for every B S Bz- A set i? € Bz is 
said to be T -invariant if T^^{B) — B. We say that T is ^-ergodic or that /i is T-ergodic if /Lt(B) G {0, 1} for 
every T-invariant set B. For n E N, the n-th (forward) iterate T" of T is defined recursively by T° = id^ and 
T"+i = T o T". For m, n > and x e Z we write T™ and T^(x) instead of (T')1+^-'^ and (r*(a;))"+7-\ 
respectively. 

Let S" be a space with the Borel cr-algebra Bs- On the product space S°°, the Borel cr- algebra is denoted by 
B'g'. An S -valued (discrete time) stochastic process is a sequence X = of random variables Xn ; 51 — )• S* 
(n £ N) defined on a probability space (17, B, P). The distribution of the process X is the measure /i = ^.x 
on (5*°°,^^) defined by /i(F) = P{X^ e F}. 

The (left) shift on is the (continuous) map T : ^ S°° defined by 

T (xjf ) = , where y„ = x^+i for every n e N. 

Let 77 : 5"°° — >■ 5 denote the projection onto the zeroth coordinate, that is, tt (xjf ) = xq. If X is a stochastic 
process with distribution /i, then the shift T together with the projection tt and the measure ^ form the 
Kolmogorov representation of the process X. From now on we always assume that X is directly given by its 
Kolmogorov representation, that is, 

{n,B,P) = {S°°,B'^,^l) and X„ = 7roT". 

A process X is (strictly) stationary if its distribution is T- invariant. The marginal of a stationary process 
X^ is the distribution of Aq. A process X is ergodic if every T-invariant event has probability either or 1. 
Thus, a process X is stationary and ergodic if and only if the dynamical system {S°° ,B'^ , ^,T) is ergodic. 

3.2. Strong law for correlation sum. For a Borel measure /i on S°° , m,k > 1 and r > define the 
correlation integral c™(r) by 

(12) cr(r)-MX/.{(a;,y): dZ\x,y)<r}. 

If /i is the distribution of a process A^, then c™(r) is the probability that, for two independent random 
vectors y(f^+'"^\ Z^'+""^ with the distribution equal to that of Xl;+"'-\ every F/", (i < k) are r-close 
according to p™: 

(13) c^(r) = ^ {e"(Kr, < r for every < i < fc} . 

The following theorem, the proof of which is postponed to Section 7, follows from [11]. 

Theorem 3. Let S be a separable metric space, X be an S -valued ergodic stationary process with distribution 
/i and TO, fc > 1 be integers. Then for ji-a.e. trajectory x S S°° of X and for every r > 

(14) hm Cr(x,n,r) = cr(r)>0 
provided is continuous at r. 
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Notice that c™ is right continuous and non-decreasing, so it has at most countably many discontinuities; 
it is continuous at r if and only if /i x ^{{x,y) : — r} is zero. Further, the convergence in (13) is 

uniform over r on any compact interval on which c™ is continuous. 

3.3. Strong laws for RQA. The purpose of this section is to show that the basic RQA characteristics 
converge almost surely to constants, which depend only on the distribution ^ of the (ergodic) process and 
on the distance threshold r. To formulate the results, we introduce the recurrence integral rr™, asymptotic 
determinism det™ and mean diagonal line length lavg™ for every r by 



^ ^ clet^(r) = and 

(15) ^ ' rr™(r) 



rrr(r) = fc.cr(r)-(A:-l).cr+i(r), 
and 



lavKl"(r) = fc + , ^ , , , 

^'^^ c™(r)-c™i(r)' 

if k is such that c™(r) = c^-^(r) > we put lavg™(r) = oo. Thus all the quantities are defined for every 
r > 0. 

Proposition 1 and Theorem 3 immediately give the following theorem. 

Theorem 4 (Strong law for recurrence rate). Under the assumptions of Theorem 3, for fi-a.e. x G 5*°° and 
for every (up to countably many) r > 0, 

lim RR™(a;,n, r) =rr™(r). 

Theorem 5 (Strong laws for DET and LAVG). Under the assumptions of Theorem 3, for fi-a.e. x E S°° 
and for every (up to countably many) r > 0, 

lim DET^(a;, n, r) = det'fe"(r) and lim LAVG^(x, n, r) = lavg^(r). 

Proof. The statements follow since a.e. -convergence is preserved by elementary arithmetic operations pro- 
vided that, for division, the numerator or denominator is non-zero. □ 

Remark 6. Theorem 3 can be trivially used to derive strong law also for another RQA quantity called the 
k-ratio defined by RATIO" = DET^/RR™. Further, for the maximal diagonal line length LMAX" defined 

by 

LMAX'" = LMAX™(a;, n, r) = max{? < n : L]^{x, n, r) > 0} 
using Birkhoff ergodic theorem one can easily show that, under the assumptions of Theorem 3, 

lim LMAX" (x,n,r) = oo 

for /i-a.e. x G S°° and for every r > 0. As a corollary we immediately have that the reciprocal value 
DIV™ = l/LMAX™ called the divergence converges almost surely to zero. 

Remark 7. Recurrence measures as well as correlation sums are often defined using strict inequalities 
d^{x, y) < r, and/or with excluding the main diagonal i = j. Clearly, the latter has no effect on asymptotic 
properties, that is, Theorems 3-5 remain true also in this case. When one uses strict inequalities, then again 
the results are valid provided strict inequality is used also in the definition (12) of the correlation integral. 
The relationship between this new "open" correlation integral and the used "closed" one is straightforward, 
see [13, Remark 2.2]. 

Remark 8. As can be seen from Theorem 19, Theorem 3 is valid with replaced by any separable Borel 
pseudometric d on S°° . For example, d can be defined via order patterns (cf. [2]): d(a;Q°,t/g°) = 1 if 
a;™,y™ have the same order pattern, d{x^,y^) = otherwise. In this way we obtain strong laws for RQA 
characteristics based on order patterns recurrence plots. 

As for "empirical" RQA quantities (see Lemma 2), the dependence of asymptotic ones on the embedding 
dimension m is straightforward provided the maximum metric is used. 
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Lemma 9. Let m, k > 1, r > and be given by (4)- Then 

c™ = c,i, rr" = rr/i - (m - l)(c/i - c/i+i) and lavg" = lavg^ - (m - 1), 
where h = k + m — 1. 

Proof. The first equahty fohows from (12) and the definition (5) of cZ™. The others are then consequences 
of (15). □ 

3.4. Asymptotic determinism via conditional probabilities. Here we assume (4). For h,l > 1 and 

r > define the conditional correlation integral by 

c (r) = '^^+'('^) = /i X fi{(y,z) : dh+i{y,z) < r} 
'^'1'*'^ Ch{r) fi X ^i{{y,z) : dh{y,z)<r} 

Particularly, if /i is the distribution of an ergodic stationary process and Y^^^ , Zq are independent 
random vectors with the distribution equal to that of Xq^', then C;|/j(r) is the conditional probability 

Ci\u{r)^l^{Q\Yi,Z{)<r\ Q\Y,\Z^)<r). 

Thus, C;|^(r) is the probability that h consecutive recurrences are followed by at least / other ones. In 
view of this we have the following interesting expression of asymptotic determinism in terms of conditional 
probabilities. 

Theorem 10. Under (4), the asymptotic determinism can be expressed via a linear combination of condi- 
tional correlation integrals 

det™ = k ■ Ck-l\m - (fc - 1) • Cfc|m. 

Consider now the special case of (ergodic stationary) Markov processes of order p > I. Then for every 
m > p one has C/|„j = C;|p. That is, over-embedding has no effect on the asymptotic determinism. 

Corollary 11. For every (ergodic stationary) Markov process of order p > 1 and for every m > p, det™ — 
det^ 

4. Asymptotic RQA measures for some processes 

Now we present some applications of the asymptotic results obtained in the previous section. We assume 
that S = {S,g) is a separable metric space and S°° is equipped with the pseudometric given by (5), 
where m is the embedding dimension, k is the prediction horizon and the embedding metric g"^ is given by 
(4). We also assume that X^ is (a Kolmogorov representation of) an ergodic stationary 5"- valued process. 
In the following we derive explicit formulas for asymptotic RQA measures for some classes of processes. To 
make the paper self-contained we include here also the proofs, though the results (at least for correlation 
integrals) are known. The convergence is demonstrated by simulation studies. We start with the simplest 
case of iid processes. 

4.1. IID processes. 

Proposition 12. Let X(^ be an iid process. Then, for m,k > I and r > 0, 

c^(r) = a'"+'=-\ wherea = cir). 

Hence 

detl\r) = a^-^ [k - [k - 1) a] and lavg^(r) = fc + 

1 — a 

do not depend on the embedding dimension m. 

Proof. By Lemma 9 we may assume that m — \. Let Yq , Zq be independent random vectors with the 
distribution equal to that of Xq. Then, for every r > 0, 

Cfe(r) = ^i{gk{YQ^, 4) <r} = n{g{Y,, Z,) < r for every < i < k} 

= n l^MY^,Z,)<r} = [c{rt. 

0<i<k 

Thus the first statement is proved. The rest follows from the definitions (15) of det™ and lavg™. □ 
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parameter r parameter r parameter r 

Figure 1. Convergence of the empirical determinism (solid line) to the asymptotic 
determinism (dashed line) for an iid process with distribution N{0, 1); n — 100, 1 000, 10 000. 

For example, the asymptotic determinism of a Gaussian iid process with variance is 
det^(r) = 2fc [2$ (/)- 1]''"' • [1-2^ _$(/)] ^ where / 



V2a 

and $ is the distribution function of the standard normal distribution. To see this, use that for iid Gaussian 
random variables Y, Z with variance (t'^,Y - Z ^ N{0, 2a^) and so c(r) = fi{\Y - Z| < r} = 2$ (r') - 1. 

Figure 1 illustrates the convergence of the empirical determinism (with m — 1 and fc = 2) to the asymptotic 
one for a Gaussian iid process. 

4.2. Markov chains. Let S = {0, 1, . . . , q — 1} be a finite space equipped with the discrete metric g (that is, 
g{x, y) — 1 \l X ^ y and q{x, y) = for x = y). Consider an S*- valued Markov chain with the transition 
matrix P = {pst)'i~t=o and the stationary distribution tt = (ttq, . . . , 7rg_i)'. Recall that tt'P = tt' and that 
X^ is ergodic if and only if the matrix P is transitive or, equivalently, the probability of the transition 
from any state s to any state < in a finite time is non-zero. The formulas for asymptotic values of RQA 
characteristics of Markov chains are given in the following proposition. As in the iid case, also here we can 
see that both the determinism and mean diagonal line length do not depend on the embedding dimension. 
(Notice that, in this discrete setting, only the distance threshold r less than 1 needs to be considered and, 
for < r < 1, RQA quantities do not depend on r.) 

Proposition 13. Let X^ be a finite-valued Markov chain with the transition matrix P and the stationary 
distribution tt. Then, for r £ [0, 1), 

jk+m-2 Aai-^^f^') — flfc-l r;_ / 7, i\ oi — j i m/„^ 7, , 



c™(r) = def^'ir) ^ /3^~'[k-{k-l)P] and lavg^(r)==fc + 



1-/3 

where a — tt'tt and /3 = (7r'diag(PP')7r)/Q!. 

Proof. Only the equality for c™(r) needs a proof since the other two follow from (15). We may assume that 
m — 1. Let Yq, Zq be independent random vectors with the distribution equal to that of Xq. Fix r < 1 and 
put a = c(r); then a = fi{d{Yo, Zq) < r} = J2s f^{^o = Zq = s} = tt'tt. 

If A; = 1 we are done. So assume that k > 2 and put /3 = Ci|i(r) — ii{d{Yi, Zi) < r \ (i(Yoi Zq) < r}. Then 



/3 = -^i{Yo = Zo,Yl = Zl} = -y^,{Yo=^Zo^s,Yl=Zl=t} 
a a ^ — ^ 

- ^ m{^o ^s,X^=tY^-Y^ ttI -pI^- (^'diag(PP» . 



s,t s,t 



Since X^ is a stationary Markov chain, we obtain 

Cfe(r) = tl{dk{Yo\Z^)<r}^^I{Y,^Z,yO<^<k} 

= fi{Yk-i = Zk-i \Y,^ Z,yO<i<k-l}- Cfe_i(r) = /3cfc_i(r). 
Now a simple induction gives the desired result. □ 
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parameter k parameter k parameter k 



Figure 2. Convergence of the empirical determinism (solid line) to the asymptotic 
determinism (dashed line) for a 3-state Markov chain with the transition matrix P; 
n= 100,1000, 10 000. 



Figure 2 depicts the convergence of the empirical determinism (with to = 1) to the asymptotic one for a 

/ 0.362 0.438 0.200 

3-state Markov chain with the (randomly selected) transition matrix P = 0.484 0.447 0.069 

\ 0.120 0.503 0.377 

4.3. Autoregressive processes. Next we consider asymptotic RQA characteristics of a (stationary) au- 
toregressive process ~ AR{p) of order p > 1 with coefficients 9i {i — I, . . . ,p) and with Gaussian zero 
mean noise of variance a^. It is given by A„ — 9i ■ A„_i + 62 ■ A„_2 + ■ ■ ■ + dp ■ X^^p + e„. 

Proposition 14. Let X^ he an (ergodic stationary) autoregressive process AR(p) with coefficients 0i, . . . ,6p 
and Gaussian WN{0, ct^). Let r > 0, to, fc > 1 and h = k + m — 1. Then 

cr(r) = /i{yo'e hr,r]n, 

where Yf^ ^ N{0, E) with S being the h x h autocovariance matrix of an AR{p) process with coefficients 



. , 9p and Gaussian WN(0, 2a 



2^ 



Proof. Since the difference of two independent AR processes with the same parameters 9i, . . . ,9p,a is an 
AR{p) process with the same coefficients and noise variance 2a^, the statement immediately follows from 
(13). □ 




parameter r parameter r parameter r 



Figure 3. Convergence of the empirical determinism (solid line) to the asymptotic 
determinism (dashed line) for an AR{S) process with parameters ai = 0.25, 02 = 0.4, 03 = 
0.3 and = 1.5; n = 100, 1 000, 10 000. 

The convergence of the empirical determinism DETj to the asymptotic one for an AR process is exhibited 
in Figure 3. 
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Corollary 11 implies that over-embedding of an AR{p) process into dimension m > p leaves the deter- 
minism unchanged. Thus, the asymptotic determinism can be used to estimate (from below) the order of 
an autoregressive process. This is demonstrated in Table 2 on an AR{3) process. There one can see that 
embedding into dimension 4 or 5 gives the determinism equal to that for dimension 3, but the determinisms 
for TO — 1, 2 are smaller. Thus one can conclude that the order of the process is at least 3. 



m 

k 


1 


2 


CO 


4 


5 


2 


0.638 

(0.009) 


0.725 

(0.008) 


0.759 

(0.008) 


0.760 

(0.008) 


0.760 

(0.009) 


3 


0.407 

(0.010) 


0.491 

(0.011) 


0.514 

(0.012) 


0.515 

(0.013) 


0.516 

(0.014) 


4 


0.258 

(0.010) 


0.312 

(0.011) 


0.327 

(0.012) 


0.328 

(0.013) 


0.329 

(0.014) 


5 


0.158 

(0.008) 


0.191 

(0.010) 


0.201 

(0.011) 


0.201 

(0.012) 


0.202 

(0.013) 



Table 2. Determinisms for AR{3) process with ai — 0.25, 02 = 0.4, 03 = 0.3 and white 
noise's variance 1.5; here n ~ 2 500 and r — \/1.5. Average determinisms with standard 
errors in parentheses obtained by a Monte Carlo simulation of size 1 000. 



5. KOLMOGOROV ENTROPY AND ASYMPTOTIC DETERMINISM 

In the following three examples we demonstrate that behavior of the RQA determinism can sometimes be 
counterintuitive. First we show that the determinism of an iid process can be higher than that of a non-iid 
one with the same marginal. In the second example it is shown that higher entropy does not necessarily 
mean smaller determinism. Finally, a Markov chain indistinguishable (from the RQA point of view) from 
an iid process is constructed. 

Example 15 (Determinism of iid and non-iid processes). Fix < a, & < 1 and consider a 01-valued Markov 
chain with the transition matrix P — {pst)l t=o such that poo = ctjPii = b. Then is ergodic and the 

stationary distribution of it is given by tt = (^ 2~a-fc ' 2-a-b ) • ^'^-^ ^ ^ ['-'' Proposition 13, 

detTirr-^- ^ kp^-' - (fc - l)/3^ where /3 = c,i(r) = ^^-°'^-[^^^';r_^!;i:;i:^ • 
On the other hand, for a 01-valued iid process with the same marginal tt. Proposition 12 gives 
det^(r)"'i = fca'^-i - {k - l)a^ where a = ''^'^^Za-ly^^^' ■ 

If we take a — 3/5 and b — 1/5, then a > /3. Since the function x i-> kx^^^ ^ {k — is increasing on [0, 1], 
we have that det"(r)'^^''^°^ > det"(r)"'i for any to, fc > 1. 



Example 16 (Determinism and entropy). The previous example also shows that higher entropy does not 
necessarily mean smaller determinism. In fact, the entropy of an iid process is strictly larger than that of 
any stationary non-iid process with the same marginal. In this simple case the entropies can be calculated 
analytically, since the entropy of an iid process is /i'"^ = — tt^ logTr^ and the entropy of the Markov chain 
is h^'^'^^°^ = ^ Sst ^sPst logPst- See also Figure 4 for an illustration of this phenomenon. 

Example 17 (Indistinguishable Markov chain and iid process). In the 2-state Markov chain considered 
in Example 15, fix 6 = 1/5 and, for given a, denote by aa,Pa the corresponding correlation integrals 
Ci(f ), Ci|i(r), respectively. Since ai/2 < /?i/2 and 0:3/5 > (3^/5, there is a g (1/2,3/5) with = Pa- For this 
particular (non-iid) Markov chain X^ , the probability of finding a diagonal line of length k (in the infinite 
recurrence plot) is the same as that for an iid process with the same marginal. Hence, no RQA measure 
based on diagonal lines can distinguish between Xg^ , Fq°° . 
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- - DET_2forMC 

DET_2forllD 

Entropy for MC 

Entropy for I ID 



0.50 0.55 0.60 0.B5 O70 0.75 0.80 

parameter a 

Figure 4. Entropy versus determinism det2 for a 2-state Markov chain (MC) with b = 1/5 
and an iid process with the same marginaL 



6. Spurious structures 

In [17], see also [12, Section 3.2.4], it was pointed out that, for iid processes, over-embedding leads to 
existence of spurious structures in recurrence plots. The appearance of spurious structures is illustrated in 
Figure 5. The left panel depicts the "usual" recurrence plot of an iid process for the embedding dimension 
m = 1. On the right panel there is the recurrence plot for the embedding dimension m — 250. It contains 
long diagonal lines, which would suggest that the process should be well predictable. 




Figure 5. The effect of equal recurrence rates for embedding dimensions m — I and 250; 
uniform iid data. 

Proposition 12 enables us to explain why this happens. In fact, this is due to a special choice of the distance 
threshold r, which selects such r = Vm that the recurrence rate RR™(r,„) is fixed to a predetermined level. 
As the following proposition demonstrates, this selection rule leads to the determinism close to one and 
average diagonal line length arbitrarily high for large embedding dimensions. 

Proposition 18. Let be an iid process. Let > and let r„i > (m €N) be such that all the recurrence 
rates rr™(rm) are equal to 9. Then, for k > 1, 

lim det™(rm) — 1 and lim lavg™(rm) = oo. 

Proof. For m > 1 put a„i = c{r„i). Then, by the assumption and Proposition 12, 9 — rr™(r,„) ~ (am)"^ for 
every m; thus «,„ = — > l for m — !■ oo. Using Proposition 12 we obtain that, for every fc > 1, rr™(r,„) = 
(am)™"'"'^^^ ■ [k — {k — l)am] -> & for rn — >■ oo, and so lim^ det™(rm) = 1 and liuim lavg™(rm) = oo. □ 
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Hence, appearance of the spurious structures for iid processes is an artefact of this particular selection 
rule for density thresholds. The artificial "predictability" which appears on the right panel of Figure 5 is 
due to the distance threshold r, which is several times higher than the standard deviation of the process. 
Different selection rule, which chooses r independently of the embedding dimension, leaves the determinisms 
det™(r) and mean diagonal line lengths lavg™(r) constant for m cxi, as one expects for iid processes. 

7. Strong law for correlation sums on pseudometric spaces 

Here we give a proof of Theorem 3, based on the strong law for correlation sums on pseudometric spaces; 
see Theorem 19 below. Recall that, for a (topological) space Z, a map d : Z y. Z ^ M+ is a pseudometric 
on Z if d{x, x) = 0, d{x, y) = d{y, x) and d{x, z) < d{x, y) + d{y, z) for every x,y, z & Z . A pseudometric d 
is separable if the topology generated by it is separable. We say that d is a Borel (continuous) pseudometric 
on Z if it is a pseudometric which is Borel (continuous) w.r.t. the product topology on Z x Z . Notice that 
a continuous pseudometric on a separable space is automatically separable. 

If d is a pseudometric on Z, Bd{x,r) and Sd{x,r) denote the (closed) c?-ball and d-sphere with radius r 
centered at x, respectively. Notice that if d is Borel then d-balls and d-spheres are Borel sets in Z; to see it, 
use that Bd{x, r) = {y : {x, y) G d~^([0, r])} and analogously for Sd{x, r). The d-diameter of a set v4 C Z is 
denoted by Aia,md{A). 

Assume that {Z,Bz,fJ.,T) is a dynamical system and that d is a Borel pseudometric on Z. For x £ Z, 
n € N and r > define the correlation sum 

Gd{x,n,r) = ^card{(i,j) : 0<i,j < n, d{T {x) , (x)) < r} 

and the correlation integral 

Cd{r) ^ ^1 X fi{{x,y) : d{x,y)<r}= / nBd{x,r) dfi{x). 

J z 

Recall that Cd is non-decreasing, right continuous and tends to 1 if r — > oo. Further, c^ is continuous at r if 
and only if fJ,Sd{x, r) — for ^-a.e. x £ Z, see e.g. [13, Remark 2.2]. 

The strong law for the correlation sum was studied under different conditions in [14, 13, 1, 15, 11]. Though 
not stated in this form, the following theorem was proved in [11]. 

Theorem 19. Let Z be a topological space, ^ be a Borel probability on Z and T : Z ^ Z be a ^-ergodic 
Borel map. Let d be a separable Borel pseudometric on Z . Then, for /i-a.e. x £ Z and for every r > 0, 

lim Cd(x, n, r) = Cd{r) > 

n— f oo 

provided Cd is continuous at r. 

Let us note that this "pseudometric" version of the strong law for correlation sums cannot be directly 
derived from the "metric" one. Indeed, it is true that one can easily obtain a metric space from the 
pseudometric one by gluing together points of zero distance, as is usually done. The considered dynamical 
system, however, does not necessarily fit to this projection and so, in general, there is no induced system on 
the obtained metric space; take e.g. the case when Z = K°°, T is the shift and d{x^,y^) — \xo — yo\. 

The proof from [11], however, perfectly fits to this general setting, as was noted by the authors. Indeed, 
it is based on the Birkhoff crgodic theorem and on the existence of finite Borel partitions A™ = {AJ^ : 
< j < Mm} (m > 1) with Ai(^o") < 2"" and diam<i(A™) < 2~" for every j > 1. Since the former is 
true for arbitrary ergodic system and the latter immediately follows from separability of (Z, d) and Borel 
measurability of d-balls, the convergence in Theorem 19 can be proved using the same reasoning as in [11]. 
Finally, the fact that Cd{r) > for every r > is obvious due to separability of (Z, d). (To see it, take any 
d-ball B with radius r/2 and n{B) > and use that Crf(r) > fi{B)^.) 

Next we show how Theorem 3 can be obtained from Theorem 19. 

Proof of Theorem 3. Let {S, g) be a separable metric space, m, A: > 1 be integers and r > be such that c™ 
is continuous at it. Let g"^ be a metric on S"" compatible with the product topology. Put Z = S°° and 
define d = d™ by (5). Then obviously d is a continuous pseudometric on Z; it is separable due to separability 
of Z. 
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Let be an iS-valued ergodic stationary process with distribution /i. We may assume that is given 
by its Kolmogorov representation, that is, X„ = vr o T", where T : Z ^ Z is the shift and tt : Z — > 5* is the 
projection xjf' i— >■ xq. Then, for every x = xf^ G Z and every n, T"(.t) = x'^ and so 

c^(r)=Crf(r) and GZ\x,n,r) ^ Cd(a;, n - fc + 1, r). 

Apphcation of Theorem 19 to the ergodic system {Z, Bz, IJ,, T) gives the desired result. □ 
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